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Abstract —In this paper, we study virtual full-duplex (FD) 
buffer-aided relaying to recover the loss of multiplexing gain 
caused by half-duplex (HD) relaying in a multiple relay network, 
where each relay is equipped with a buffer and multiple antennas, 
through joint opportunistic relay selection (RS) and beamforming 
(BF) design. The main idea of virtual FD buffer-aided relaying is 
that the source and one of the relays simultaneously transmit 
their own information to another relay and the destination, 
respectively. In such networks, inter-relay interference (IRI) is 
a crucial problem which has to be resolved like self-interference 
in the FD relaying. In contrast to previous work that neglected 
IRI, we propose joint RS and BF schemes taking IRI into 
consideration by using multiple antennas at the relays. In order 
to maximize average end-to-end rate, we propose a weighted 
sum-rate maximization strategy assuming that adaptive rate 
transmission is employed in both the source to relay and relay to 
destination links. Then, we propose several BF schemes cancelling 
or suppressing IRI in order to maximize the weighted sum- 
rate. Numerical results show that our proposed optimal, zero¬ 
forcing, and minimum mean square error BF-based RS schemes 
asymptotically approach the ideal FD relaying upper bound when 
increasing the number of antennas and/or the number of relays. 

Keywords — Full-duplex, buffer-aided relaying, inter-relay inter¬ 
ference, relay selection, beamforming 

I. Introduction 

Since cooperative relaying can improve both spectral effi¬ 
ciency and spatial diversity, it is a promising core technology 
for next-generation wireless communication networks. So far, 
most studies have considered half-duplex (HD) relaying based 
on two-phase operation where a source transmits data to 
relays at the first time slot and the relays forward it to a 
destination at the second time slot ID, 0. However, such HD 
relaying causes a loss of multiplexing gain expressed as an 
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one-half pre-log factor. To overcome the loss of multiplexing 
gain, several practical full-duplex (FD) relaying solutions have 
been studied |3l- lfl2l . Since strong self-interference is a main 
problem which has to be resolved in FD relaying, the previous 
work primarily focused on self-interference cancellation based 
on antenna separation techniques in the wireless propagation 
domain and signal cancellation techniques in the analog circuit 
and digital domains. Although these studies showed feasibility 
of FD relaying using small-scale wireless communication 
devices such as WiFi and IEEE 802.15.4, the technology is 
still premature for cellular communications, which require 
additional cancellation gains due to practical limitations such 
as varying center frequencies, bandwidth, and circuit imper¬ 
fections. 

In order to mitigate the loss of multiplexing gain in HD 
relaying, successive relaying protocols have been proposed for 
a two-relay network ED-ED and multiple-relay networks 
ED-Eo). In these protocols, two relays take turns acting 
as receivers and transmitters successively and a source and 
a transmitting relay transmit their own information simul¬ 
taneously. Here, the source transmits new information and 
the relay transmits previously received information. The main 
issue for such successive relaying protocols is to efficiently 
handle inter-relay interference (IRI) from the transmitting 
relay to the receiving relay. Towards this end, successive 
interference cancellation (SIC) and/or sophisticated coding and 
joint decoding techniques have been employed in the literature. 
However, the SIC requires strong interference scenarios and 
the joint decoding requires high computational complexity. 
Furthermore, although the successive relaying asymptotically 
achieves the spectral efficiency of the FD relaying with respect 
to the number of channel uses, it requires a sufficiently long 
block length (equivalently, coherence time) over slow fading 
channels. 

Employing a buffer at the relay, such long block length 
constraints can be relaxed. Focusing on these advantages, 
buffer-aided relaying has been proposed in a three-node net¬ 
work ED-131. The key idea is an opportunistic relaying 
mode selection (buffering or forwarding) according to channel 
conditions. HD buffer-aided relaying can achieve up to two¬ 
fold spectral efficiency under asymmetric channel conditions 
between {<S—>-7?.} and {'IZ—t'D} links, compared to HD re¬ 
laying without buffer. Additionally, bidirectional buffer-aided 
relaying with two-way traffic ESJ-EEl, buffer-aided relaying 
over dual-hop broadcast channels Il29l and a shared relay 
channel with two source-destination pairs |30l have been 
studied. By extending to multiple-relay networks, several op¬ 
portunistic relaying schemes, which exploit the best HD buffer- 
aided relay, have been proposed GD-G3. Ikhlef et al. ED 
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have proposed a max — max relay selection (MMRS) scheme, 
which selects the best {5— >1Z} and {71—>T>} relays with the 
maximum channel gains. However, the MMRS scheme does 
not fully take advantage of the benefits of buffer-aided relaying 
since it maintains the two-phase operation. Therefore, Krikidis 
et al. If32l have proposed a max —link relay selection (MLRS) 
scheme, which selects the best relaying mode as well as the 
maximum channel gain. 

Most recently, Ikhlef et al. m have proposed a space 
full-duplex max — max relay selection (SFD-MMRS) scheme, 
which mimics the FD relaying by utilizing the best receiv¬ 
ing and transmitting relays operating simultaneously. In this 
scheme, they did not consider IRI by assuming fixed-relays 
with highly directional antennas. However, this assumption 
does not always hold and it is hard to be practically realized 
as the number of relays increases. With consideration of IRI, 
Kim and Bengtsson lf34l proposed a virtual FD buffer-aided 
relaying scheme based on opportunistic relay selection (RS) 
with zero-forcing beamforming (BF) for IRI cancellation in 
order to maximize average end-to-end rate assuming adaptive 
rate transmission. Nomikos et al. E3, ED have proposed 
a buffer-aided successive opportunistic relaying (BA-SOR) 
scheme employing SIC at the receiving relay for fixed rate 
transmission. In l35l . even if it partially overcame the strong 
interference requirement of SIC through power allocation at 
the source and relays, the main objective was to minimize the 
total energy expenditure. In l36l . the average end-to-end rate 
of the BA-SOR scheme, which has been originally devised for 
fixed rate transmission, is numerically shown for adaptive rate 
transmission. However, a fixed low SIC threshold, ro (e.g., 2 
bps/Hz), has been applied even for adaptive rate transmission 
whereas the threshold value should be set to the information 
rate of the {71—>V} link. Thus, this yields an optimistic result 
in terms of the average throughput. 

In this paper, our main goal is to approach the average 
end-to-end rate of ideal FD relaying even in the presence of 
IRI. To this end, we propose transmission schemes based on a 
joint RS and BF design utilizing multiple buffer-aided relays 
and multiple antennas at the relays. For the joint RS and BF 
design, we first propose a weighted sum-rate maximization 
using instantaneous channel and buffer states for achieving 
the average end-to-end rate maximization. Then, we separately 
design linear BF for each (receiving and transmitting) relay 
pair, which cancels or suppresses IRI, and optimal RS for 
maximizing the weighted sum-rate based on the beamformers 
found for each relay pair. To focus on maximizing the average 
end-to-end rate, we employ adaptive rate transmission at the 
source and relays (i.e., channel state information at transmitter 
for both nodes) and consider delay-tolerant applications. Our 
main contributions in this work are summarized as follows: 

• A new RS criterion based on a weighted sum of instan¬ 
taneous rates is proposed to maximize the average end- 
to-end rate in a virtual FD buffer-aided relaying network 
with adaptive rate transmission. 

• Various transmit and receive BF design strategies at the 
multiple antenna relays are proposed in order to cancel 
or suppress IRI. 

• We show that joint RS and BF schemes achieve the 


ideal FD relaying bound in terms of the average end- 
to-end rate asymptotically with increasing the number 
of antennas and/or the number of relays. 

• Compared to our previous work (34), we propose more 
practical joint RS and BF schemes which can support 
non-identical channel conditions, including an iterative 
optimal BF-based RS scheme which achieves the best 
average end-to-end rate. Moreover, we provide extensive 
numerical results including average end-to-end rate, av¬ 
erage delay, effect of IRI intensity, behavior of optimal 
weight factors, and effect of finite buffer size. 

The rest of this paper is organized as follows. In Section HU 
the system model is presented. The instantaneous rates and 
average end-to-end rate of a buffer-aided relaying network are 
described in SectioniHll Buffer-aided joint RS and BF schemes 
considering IRI are proposed in Section [IV] In Section [V] the 
performance of the proposed schemes are evaluated through 
simulations. Finally, conclusive remarks and future work are 
provided in Section [VT] 

II. System Model 

In this paper, we consider a source, S, and a destination, V, 
which have a single antenna, and K buffer-aided relays with M 
antennas each (e.g., in Fig. [T| M = 2). Denote the set of HD 
buffer-aided decode-and-forward relays by fC = {1 ,... ,K}. 
We assume that there is no direct path between the source 
and destination as in the related literature im-ES), EMM). 
This system model can be regarded as an example of relay- 
assisted device-to-device communications where the source 
and destination are low-cost devices with some limitations 
such as a single antenna. The source is supposed to always 
have data traffic to transmit. In addition, let h^,, h jx>, and 
H ji, i,j £ K, denote the channel coefficient vectors and 
matrices of {<S—>■/}, {j-t-77}, and {.)—>*} links, respectively. 
The channel fading is assumed to be stationary and ergodic 
and in the asymptotic analysis and numerical examples, we will 
make the additional assumption that all the channel coefficients 
follow circular symmetric complex Gaussian distributions, 
hsi ~ C/V(0, cr| K .I), h jx> ~ CAf(0, (Tti-xX), and vec[H,i] ~ 
CAf(0, cr^ .^.I) where vec[-] denotes the vectorization of a 
matrix. 

In order to mimic FD relaying, {<S—>-75.} and {TZ— 
transmissions are performed simultaneously by using the best 
pair of receiving and transmitting relays as in lf33) -ll36l. To 
this end, the receiving relay decodes the data received from the 
source and stores it in its buffer, while the transmitting relay 
encodes data from its buffer and sends it to the destination. 
For a given selected relay pair (i, j), i ^ j, the received signal 
vector at the receiving relay i is 

y, — Ii.S'/ 't's T FIj, Xy T n, — 1 i,Sj J's T T F / T ri,. 

( 1 ) 

where hg, £ C Mx 1 denotes the channel vector from the 
source to the i-th relay, Hjj £ C MxA/ denotes the inter¬ 
relay channel matrix from the j-th relay to the z-th relay, 
xs denotes the transmitted data symbol from the source, and 
x, = w jXj denotes the transmitted data symbol vector from 
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Fig. 1. System Model: a single source, a single destination, and multiple 
relays with buffer and multiple antennas (e.g., M = 2). 


the j-th relay where w j = [wj ,..., Wj 1 } 1 and Xj represent the 
transmit BF vector of the j-th relay and the transmitted data 
symbol of the j-th relay, respectively. Here, E [|ss| 2 ] < Ps 
and E [\x 3 1 2 ] < Pjz where Ps and Pr, denote the maximum 
transmit powers of the source and the relay, respectively, and 
11Wj 11 = 1 for j, j £ 1C where || • || denotes the 2-norm, n,; 
denotes an additive white Gaussian noise (AWGN) vector with 
zero mean and covariance er 2 I, i.e., n, ~ CA/”(0 ,ct 2 I). For a 
given selected relay pair (j, j), i ^ j, the received signal at 
the destination is 

yv j) = h fD x j + ™v = jXj + n-D, (2) 

where hjz> £ C AIx1 denotes the channel vector from the j-th 
relay to the destinatiorQ. (-) H denotes the Hermitian transpose, 
and nx> denotes AWGN with zero mean and variance <j\, i.e., 
n-D ~ CA/"(0,ct 2 ). 


III. Instantaneous and Average Data Rates of a 
Buffer-Aided Relaying Network 

Using a linear receive BF vector u, = [uj ,..., uf 1 } 1 - at the 
j-th receiving relay, with ||uj|| = 1, the received signal after 
the receive BF becomes 

r { i' 3) = uf yf j) = ufh Sl x s + u," ll,,w ; .r y + fn, (3) 


respectively, as: 


_ Ps\ufh Sl \ 2 
Sl 1 +Pw|ufHjiWj| 2 ’ 

= PTjIhj^Wjl 2 , 


(4) 

(5) 


where ps = Ps/&n an d Pn = Pnl&n- Let Bi(t) denote 
the number of bits in the buffer normalized by the number 
of channel uses of the j-th relay at the end of time slot 
t. Assuming a Gaussian codebook and information theoretic 
capacity achieving coding scheme, and taking the buffer state 
at the receiving relay i into consideration, the instantaneous 
rate of the {5— >i} link is 


Csf\t) = min jlog 2 (l +7sJ j) ( i )) , B max - Bi(t - 1) j , 

( 6 ) 


where min{-, •} denotes the minimum value of arguments, 
Bmax denotes the maximum buffer size, and the buffer of the 
j-th relay is updated by 


B i (t)=B i (t-l) + C& j) (t). 

For the transmitting relay j, the instantaneous rate of link 
{j— >V} in time slot t, is 

Cjv\t) = min jlog 2 (l+7 ,Bj(t- 1)} , (7) 

where the buffer of the j-th relay is updated by 

The average received data rate at the destination, over a time 
window of length T, is given by 

o »=?E c j;s !£“”(*>. («) 

t=i 

where i(t) and j(t) denote the indices of the selected receiving 
and transmitting relay indices in time slot t, respectively. 
Similarly, the average transmitted data rate at the source is 

Cs = ^J2 c sm m) ^- ( 9 ) 

t= 1 


IV. Buffer-Aided Joint Relay Selection and 
Beamforming Schemes in the Presence of 
Inter-Relay Interference 


for given relay pair (i,j), where h t = ufn j ~ CA/"(0,cr 2 ). 
From ((3} and Q. the instantaneous received SINR/SNR for 
the {iS^-j} and {j —>'Z2} links at time slot t are expressed. 


’For notational convenience, we define hjx> as the complex conjugate chan¬ 
nel vector differently from the definition of I 15 ; , i.e.. lifp = - D ... ., hj£,\ 
while h Si = [h^ v h^ i ] T . 


Our objective is to maximize the average data rate given 
in ®, through a joint RS and BF design. Optimizing the 
minimum of the instantaneous rates,_ which corresponds to 
maximizing a lower bound on min{Cs, Cx>} lf2P . was pro¬ 
posed in 071 as an HD best relay selection criterion. Since this 
is suboptimal in the presence of IRI, we here propose an al¬ 
ternative approach, by formulating the following optimization 
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problem. 


max Cx> 

{“i(0 


( 10 a) 


s. t. 


1 

T 


t= 1 


-Cl 


(>WjW) 

kV 


(t) 


> 0, Vfc € /C, (10b) 


l|u i( t)|| < 1 , ||w i(t) || < 1 , ( 10 c) 

(lOd) 


In order for the relay buffers to stay non-empty, we have here 
added the constraints (1 1 Obb (the buffer is absorbing). Obvi¬ 
ously, we want (1 1 Obb to hold with equality (buffer stability), 
but the inequality formulation is exploited below to determine 
constrains on the weighting parameters. Problem (fTTTb is non- 
convex in the beamforming vectors and combinatorial in the 
relay selection. For tractability, we therefore use a Lagrange 
relaxation [38), studying the partial Lagrangian with dual 
variables 07 , corresponding to the constraints (1 1 Obb . 


keK. \ t =1 


W ~ ^ 


O(f)ifW) 


kV 



(ID 


Similarly to ED, and for the moment assuming unbounded 
buffers, it can be shown that the probability that Bj(t — 1) < 

log 2 ^1 + (f)'j goes to zero, if the channel fading is 

stationary and (11 Obl i holds. Therefore, the RS and BF can be 
optimized separately for each time step. Collecting the terms 
corresponding to time step t, the primal variables optimizing 
£(■) are given by 


max ctiCgf^ + (1 - otj)C^ ( 12 a) 

{u»,w 

s. t. {TOD & {USD ( 12 b) 


and it follows from the inequality in (11 Obb that ctk > 0 at the 
optimum. A similar argument, instead optimizing the average 
transmit rate at the source, under constraints that the average 
relay buffers are non-increasing, with Lagrange multipliers 
1 — o : k, gives exactly the same criterion (1 1 2 b . but with the 
constraints that 1 — otk > 0. The conclusion of this Lagrange 
relaxation is therefore that the optimal RS and BF strategy can 
be done separately for each time step based on instantaneous 
information and has the form ( 1 1 2 b . for some choice of weight 
factors (dual variables) ctk with 0 < ctk < 1 . 

One possible approach to determine the optimal ctk is to use 
subgradient optimization. As shown in [39), a subgradient for 
ctk is given by the left hand side of (1 1 Obb . However, to avoid 
having to run the system T time steps between each update 
of ctk, we propose to replace the time average in (11 Obb by an 
exponentially weighted sum 




r —0 


( r l - 


( 13 ) 


where the forgetting factor A is close to, but less then one, and 
update oti at each time step using 

a fc (f) = min{l,max{0,afc(f - 1) - //(f)A Sfc (f)}} (14) 

for some suitable choice of step size Note that As fc (f) 
only will be an approximation of the true subgradient, since it 
is influenced by several earlier values of ak (t) and also since it 
is stochastic. Still, numerical experiments have always shown 
convergence. 

Since keeping all the buffers stable is a necessary optimality 
condition, an alternative approach is to use the back-pressure 
algorithm ED, ED. Assume a stationary stochastic arrival 
process with a rate less than or equal to the achievable 
source-destination rate and let B$ (t) denote the source buffer 
occupancy at time t. Then, a standard derivation of the back¬ 
pressure algorithm, using the Lyapunov function Bg(t) + 
Bf(t), gives the following design criterion, 

max (B s {t) - Bi{t))C™ + (15a) 

{Ui.Wj.Ij} 

s.t. ([TOD & ( fTOdb (15b) 

Dividing this criterion by Bs(t) gives an expression of exactly 
the same form as (fl2) . if we set oti = 1 — Bi(t)/Bs(t). 
Therefore, an alternative approach to determine the optimal 
is to run the back-pressure algorithm until it reaches 
stationarity, and use a time average of 1 — Bi(t)/Bs(t) as 

OLi. 

Throughout this paper, we consider an exhaustive search 
under global channel state information (CSI) and buffer state 
information (BSI) for all proposed schemes to obtain the 
optimal performance in RS. A discussion on the complexity of 
our proposed schemes for both RS and BF aspects is provided 
in Section ITV-FI 

The conventional centralized/distributed RS approaches 
ED, E2, E3 can be applied for implementation of the 
proposed RS schemes. To reduce the amount of CSI feed¬ 
back distributed RS approaches are more desirable than the 
centralized approach in practice. For example, first of all, the 
relays can estimate channels based on orthogonal pilot signals 
from the source and the destination at the same time. Then, 
the relays can estimate inter-relay channels in a round robin 
manner using orthogonal MIMO pilot signals. If BSIs and 
CSIs for {<S— >TZ} links (i.e., li^’s) are shared among the 
relays, each relay is able to select the local-best receiving 
relay by regarding itself as the transmitting relay. Similarly 
to the timer-based distributed RS in G3, S3), each relay 
sets a timer based on an inverse of its local-best objective 
function value and sends a request-to-send message after timer 
expiration. Then, the destination sends back a clear-to-send 
message for the earliest access relay. If the relay receives the 
message, it broadcasts the RS information to all the nodes. 
Afterwards, the source and the transmitting relay start to 
transmit their own packets. Through this procedure, the best 
relay pair can be determined in a distributed manner. The 

2 Since the proposed schemes require up-to-date CSI, they are primarily 
applicable in scenarios with low mobility, relative to the symbol duration, 
such as typical WiFi deployments. 
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detailed implementation issues are beyond the scope of this 
work. 

The instantaneous rates in (ITD are determined by effective 
SINR/SNRs at the receiving and transmitting relays, depending 
on the transmit and receive beamformers. Thus, the transmit 
and receive beamformers have to be determined separately for 
each candidate pair of relays. However, finding the optimal BF 
vectors for every given relay pair is non-convex and therefore 
we propose an iterative optimal BF-based RS scheme. Since 
the iterative solution requires a high computational complexity, 
several low-complexity suboptimal BF-based RS schemes are 
also proposed in the rest of this section. Note that the sub- 
optimal schemes have less complexity in the BF design than 
the optimal scheme but the same RS protocol complexity. (See 
Section irV-FI ) For simplicity, we omit the time slot index t in 
the relay indices i(t) and j(t) hereafter. 

A. Proposed Optimal Beamforming-based Relay Selection 
Scheme 

Obviously, if there is no IRI, maximal ratio combining 
(MRC) at the receiving relay and maximal ratio transmit 
(MRT) BF at the transmitting relay, named IRI-free BF, are 
optimal. However, this idealized IRI-free BF is not optimal in 
the presence of IRI. In this subsection, we propose an iterative 
optimal BF-based RS scheme to maximize the average end- 
to-end rate. Denoting /( 7) = log 2 (l +7), the optimization 
problem (fl 2 l > for given {q,} and for each given relay pair (i.j) 
is 


, pslu^hsil 2 , 

where 7 = - n H -tt— 1 T 2 and 7 i"D = 

' png" wjwygji+lluiH 2 '■i 1 ' 

(>'R. hj-ow ; w ■ ; h y -p. Hence, the KKT conditions give 
(AI + pgjigfi) w j = (1 - a j )f('y jT >)pnhjvhf T> w j , (20) 


where p = 


has the form 


(pk gjVi w 3 " gji + II Ui II 2 


p-ji. Thus, the optimal w j 


w j = c w (AI + pgpg 


U d, 


( 21 ) 


for some values of the positive real-valued parameters A and p, 
and scaling constant c w . Using the matrix inversion lemmafl 
(ED is rewritten as 


(a 1 I — A 1 g ji (gf l g jl + X/p) 1 gf^j 


1 jD 


Cw I , 

= T ^ ~ (yttfr 

\ S-ii Sji 


C,r / , 

A \ JP gjigji 



1 - 


gjfgji \ gj! h jp 

gfigji + A/p) g'!g ; , 


r>ji oji 


( 22 ) 


j (16) 

S. t. ||Ui || < 1, || Wj || < 1. 

Since this optimization problem cannot be solved directly 
due to non-convexity, we propose an alternating optimization 
which iterates between (i ) for fixed wj, optimizing u,, and (ii) 
for fixed u,;, optimizing wy. 

For given wj, the optimal receive beamformer u, is obvi¬ 
ously given by the MMSE solution, 

Ui = c u (/»/cH.,w,w' / n(( + I) 1 h Si , (17) 

where the scaling factor c u is selected such that ||ui|| = 1 . 

For given u,, optimizing the transmit beamformer w j is a 
non-convex problem. Denote gp = H((u,. The Lagrangian of 
the optimization problem is 


max 

{ui.Wj.i.j} 


af 


fpn H, 


ps|uf lisd 2 


WiW^H 1 


;+!)ui 


+ (i - a)/ (pnhf^ 


£{wj,\) = af 


Ps |ufh 5 . 


Pv. gj[ W ;i g j 


U, 


+ (! - <*)/ h jX >) + A (l - wf Wj) 

where the gradient with respect to w ; is obtained by 

Vw, -£( w ji A ) 

a/'( 7 Sz)Fs|ufh 5i | 2 0 _ 

rr "" it PKni’gj, w j 

(PKgfiWjW? Sji + ||Ui|| 2 ) 

+ (1 - rff'ilj-D^pTzhj-nhfvWj - 2X\Vj, 


(18) 


(19) 


where wj is defined as the projection of h j-p onto the 

orthogonal subspace of g- Jt and wj is defined as the projection 
of h j-p onto g ji. Let us define the corresponding normalized 

vectors w, = w j i and w 2 - = .j" 3 , .. . Since ||w,-|| = 1 at the 

J ||wj || 3 ll w i II " 

optimum and the scaling constants in (l22l) are real valued and 
positive, the optimal w j has the form 

w j = fiwj + y/l — /? 2 w^“, (23) 

for some value of € [0,1]. Thus, the optimization problem 
at hand is reduced to find the optimal /3 parameter for given 
{i, j) relay pair. Substituting (ED into the original objective 
function gives 

, ( Ps|ufh 5l | 2 \ 

max at - n - 

^ 6 [° 7 ] \pw| 0 g£ w J | 2 + IHIV 

+ (! - a)f [pn\^%w) + yi^h-w-l 2 ) , 

(24) 

where /( 7) = log 2 (1 + 7). In general, this objective function 
may have multiple optima in the interval [0,1] with respect 
to /3. However, a rough grid search in this interval and a few 
Gauss-Newton steps starting at the optimum grid point after 
the grid search can quickly find the global optimum. Note that 
(ED is easily vectorized which can speed up the grid search. 

3 L4 - UD~ 1 V)~ 1 = A- 1 + A~ 1 U (D - VA-W)- 1 VA -1 (45). 
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Equations (l22l > and ( 124b are iterated to find the optimal 
beamformers pair until a fixed number of iterations is reached 
or a certain stopping condition is satisfied^] 

Let the optimized beamformers and (3 parameter be denoted 
by u*, w*, and /3*. Then, the instantaneous SNRs for the 
{5— >i} and {j^V} links are expressed, respectively, as: 




As|u* H h 5l | 




*"$ w / 


= PK 




+ y/l - 


(25) 


(26) 


where wj = /3*w|-* + i/l — (3* 2 wj-*. Hence, substituting (f25l> 
and (l26l > into ([ 6 ]) and ([7]), respectively, the best relay pair is 
selected by ( IT5l >. 


B. Proposed Relay Selection Scheme with Zero-Forcing Beam¬ 
forming (ZFBF)-based IRI Cancellation 

In this subsection, we propose to optimize a transmit 
beamformer based on zero-forcing (ZF) at the transmitting 
relay. First of all, we use the MRC beamformer for the 
receiving relay i, i.e., u, = and then maximize the 

effective channel power gain of the link under a ZF 

condition. Therefore, for a given relay pair the following 
optimization problem is formulated: 

max Iw^hjx *! 2 (27 a) 

s. t. ufH^Wj = 0, (27b) 

||wj|| = l. (27c) 

Let Vji G be a matrix whose columns span the 

null-space of g ji = Hj-u ( . Then, any BF vector w j fulfilling 
the first constraint in (127bl) can be written as w t = V-^z, 
where z G ^(M-i)xi Hence, the optimization problem is 
reformulated by 

max |z ff V^h 7 x >| 2 (28a) 

Z J 

s. t. ||Vjjz|| = 1 , (28b) 


which is equivalent to 
max 

Z 

S. t. 


z //V !'h/phjpV,,z 
Z ff VfV 3 ,z 
IIVj-izH = 1. 


(29a) 

(29b) 


The solution of this problem is z* = c z ( Vj j V 7 ,)“ 1 h 7 x>, 
resulting in w* = c z V ji (V^V ji ) _1 -V? h jT> = c z w*, where 

the scalar c z is chosen so that ||w*|| = 1 , i.e. w* = .Tl. in 
which 


= V;»(V (( Vj) 'V"h, 7 , = 

= li p - cH"h Sil 


\ g"g ji J 


hj V 


(30) 


4 In numerical results, we used a stopping condition where the differences 
in vector norms should be under a certain error tolerance (e*), i.e., ||uj(n) — 
Ui(n — 1)|| < et and ||wj(n) — w j(n — 1)|| < et for the n-th iteration. 
For all results, we set et = 10“ 4 and then the number of iterations is about 
several hundreds depending on channel realizations. 


where the scalar value c = 


h" H, 


since u 7 ; = This 


optimum solution implies a projection of h ; p onto the null- 
space Vji. 

Therefore, substituting u, and w j into (|4ji and (0, the 
instantaneous SNRs for the {5—>7} and links are 

expressed, respectively, as: 


4?V) = Ps I 


LSi| 


7 { £\t) = 


Pn | ||njp| 


h ? P - cH"h 5 


(31) 

(32) 


where the scalar values c = hsiHj * h3 ” 


h«H 


and c = 

and d32l ) into 


|hj 


H, • 

6 J) and Q, 


As a result, substituting yT 
respectively, the best relay pair is selected by ®. 

Proposition 1: The proposed ZFBF-based RS scheme 
asymptotically achieves the average end-to-end rate of ideal 
FD relaying with probability one as the number of antennas 
(M) goes to infinity. 

Proof: Let (H) m denote the m-th column vector of the 
matrix H and (H) mj ; denote the (m, ()-th element of the 
matrix H. Denoting g jt = [ 3 L 3 L... ,g*(} T in g% = 


(Hj») 


H 


U, 


CAA(0, £ 7 ^) since (H 


and ||uj|| = 1. Assuming = 1 without loss of gener- 


CA/”(0, OppJ 


ality, 5 ™ - CAf(0, 1 ) and therefore g"g\, = E™=i Iff ™! 2 
follows a chi-squared distribution with 2 M degrees of free¬ 
dom, i.e., gjfgji ~ X 2 m- Meanwhile, the diagonal elements, 
(gjigT )m,m — |< 7 ^| 2 , follow an exponential distribution with 
parameter one, i.e., (g i? g~ Exp(l),Vro G {1,... ,M} 
and the off-diagonal elements, (gjigjf) m ,!, Vm ^ l, follow the 
distribution of a product of two independent Gaussians with 
zero mean and unit variance (see 06 l)- The distribution of 
each element in g 7 ? gj[ is not varying with respect to M while 
only the size of matrix grows according to M. As a result, 
as M — > 00 , the denominator gjjg ;? goes to infinity, while 

all the elements in the numerator g ?) ;g(( remain as constant 

Jh 


SjiS, 


values with respect to M. Hence, — 


Accordingly, w* 


SfS ji 

and 7 fy\t) 


0 as M 


00. 


-f Pii\\hjv\\ 2 as 


'0 ' ||hjx>|| 

M —>• 00, which completes the proof. 

Remark 1: The proposed ZFBF-based RS scheme also ap¬ 
proach the average end-to-end rate of ideal FD relaying as 
the number of relays ( K ) goes to infinity due to increased 
selection diversity. However, increasing the number of relays 
cannot guarantee to achieve the performance of the ideal FD 
relaying as in Proposition [T| since its selection diversity is 
always less than or equal to that of the ideal FD relaying. In 

S H 

other words, the best transmitting relay should meet f 1 = 0 
for achieving the same performance as the ideal FD relaying. 


0 with 


Although there exist certain relays satisfying s j) S| * 

SjiSji 

high probability as K goes to infinity, they are a subset of the 
set of relays while the ideal FD relaying can always take the 
full selection diversity due to no IRI assumption. 

Remark 2: If the ZFBF solution in (f!T()b is set to the 
initial vector for w j in the first step of the proposed it¬ 
erative optimal BF, it always yields a better solution than 
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the ZFBF, regardless of the number of iterations, since 


PS hfijhlf u 


= argmax 


psuf h Si hg. 


7777 


which yields exactly the same BF pair of the ZFBF- 
based RS scheme and an additional iteration gives a better 
solution. We therefore propose to initialize the alternating 
optimization in this way, even though there are no guarantees 
that this will provide the global optimum. 


C. Proposed Relay Selection Scheme with Minimum Mean 
Square Error (MMSE)-based IRI Suppression 

In this subsection, we propose a receive beamformer based 
on minimum mean square error (MMSE) for maximizing the 
effective SINR at the receiving relay. First of all, we use the 
MRT beamformer for the transmitting relay j, i.e., w 7 = 
, and then we find a receive beamformer for maximizing 
the effective SINR at the receiving relay. Therefore, for given 
relay pair (i, j) and w,. the following optimization problem 
is formulated: 


AsufhsjhgjUi 

max —-- —— -r— 

u, uf (preHj-iWj-wf Hjf + I) 11 , 

S. t. ||u»|| = l. 


(33a) 

(33b) 


The solution of (l33l i is given by the scaled MMSE as u; = 
c m ( pn H;,; Wj v/j 1 H j- + I) h 5i where the scaling factor c m 
is chosen such that ||iij|| = 1. 

Therefore, substituting u, and w j into © and ©, the 
instantaneous SNRs for the {<S —>• /} and {}—>23} links are 
expressed, respectively, as: 

lif (t) = Ps hf, + i) h Si , 

( 34 ) 

= Pn\\hjT>\\ 2 - ( 35 ) 

Substituting (l34l > and (|35T> into © and ©, respectively, the 
best relay pair is selected by O- 

Proposition 2: The proposed MMSE-based RS scheme 
asymptotically achieves the average end-to-end rate of ideal 
FD relaying at low SNR. 

Proof: As p R -> 0, (f) -> psh^I^hst = 

Asl|h<si|| 2 > which completes the proof. ■ 


D. Proposed Relay Selection Scheme with Orthonormal Basis 
(OB)-based IRI Cancellation 

In this subsection, we propose a perfect IRI cancellation 
BF based on orthonormal basis vectors. To this end, we first 
generate two random orthonormal vectors u and q, i.e., u ;/ q = 
0, ||u|| = 1, and ||q|| = 1. Then, we use u as the receive 

H 7.^q 

beamformer at the receiving relay and w, = -— M as the 

ll H ji nil 

transmit beamformer at the transmitting relay, respectively. 

Since q = 0, llull = 1, and 

11 Wj 11 = 1 , substituting u and w j into © and ©, the 


instantaneous SNRs for the (5—>2} and { 7 —>29} links are 
expressed as: 

= Ps\hsi\ 2 , (36) 

l ( tv{t) = p n \h j - D \\ (37) 

where h$i = u ff h,s ? ; ~ CJ\f(0, cr 2 sv ) and hjv = h^w j ~ 
CN{Q,a RT >). As a result, substituting (f36t and (1371) into © 
and ©, respectively, the best relay pair is selected by (IT?] ). 

Remark 3: From (l36l > and (l37l >. the proposed OB-based 
RS scheme achieves the average end-to-end rate of ideal FD 
relaying with a single antenna at the relays. 

Remark 4: The proposed ZFBF-based RS scheme is always 
better than the proposed OB-based RS scheme in terms of the 
average end-to-end rate while both of the schemes perfectly 
cancel IRI. However, it is not always better in the viewpoint 
of instantaneous RS and BF, since the randomly chosen 
beamformers in the proposed OB-based RS scheme might be 
better at a certain RS instance for not the {5—>-7?.} link rate 
but the weighted sum-rate. 


E. Proposed SINR-based Relay Selection Scheme with Beam¬ 
forming Neglecting IRI 

Although the IRI-free BF is not optimal in the presence of 
IRI, we propose to use them and utilize effective SINR/SNR 
measures after BF in RS as the simplest joint RS and BF 
scheme. Accordingly, for relay pair ( i, j), the receive BF vector 
is given by u, = .. and the transmit BF vector is given 

by w j = |‘ 7 ” || . Substituting u, and w j into © and ©. the 
instantaneous SINR and SNR of both the {5—>7} and (j—>27} 
links are obtained, respectively, by 


-& 1} m 
fi!\t) 


||h5i|| 2 p5 

1 + ||h 5 i || 2 ||h^|P^ 

Pn\\^jv\\ 2 - 


(38) 

(39) 


Substituting (l38l > and ( 13 91 into © and ©. respectively, the 
best relay pair is selected by ( IT5t . 


F. Discussion on Complexity of the Proposed Joint Relay 
Selection and Beamforming 

For the RS protocol, an exhaustive search within K x (AT—1) 
combinations with global CSI and BSI is required to obtain 
the optimal performance. Accordingly, the complexity of the 
optimal relay pair selection is 0{I\ 2 ) for all the schemes. 
However, through the distributed RS approach, CSIs for 
{A—>27} and {'IZ^'R.} links can be directly estimated at each 
relay from pilot signals and thus the amount of feedback on 
CSI can be reduced from (2202 + M 2 K(K — l)/2) to KM. 
It is also worth mentioning that the number of available (fixed) 
relays in practical network scenarios would not be so large due 
to geographical limitations. On the other hand, the proposed 
schemes can still be effective even for a two-relay network 
if sufficient number of antennas are available at the relays. 
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Moreover, as will be shown in Section IV-BI the performance 
enhancement from more than five relays would be marginal. 

For BF design for given relay pair, if we denote the com¬ 
plexity for computing a BF vector including M dimensional 
matrices and/or vectors by Cbf, the complexity of all the 
suboptimal BF schemes becomes 2Cbf since they require to 
compute two BF vectors. However, they can have different 
computational times in practice according to degree of matrix 
and/or vector computations. For instance, the MMSE-based 
scheme requires to compute M dimensional matrix inversion, 
while the SINR-based scheme only requires vector normaliza¬ 
tions. On the other hand, the complexity of the optimal BF 
scheme is derived as L(2Cbf + Cp), where L denotes the 
number of iterations and Cp denotes the complexity to find 
the optimal /? including both a rough line search and some 
Gauss-Newton steps. Consequently, the optimal BF scheme 
requires additional complexity for iteration process and an 
additional optimization parameter, compared to the suboptimal 
BF schemes. After all, the suboptimal BF schemes are useful 
for online operation and a network which consists of nodes 
with low computation power. 

V. Performance Evaluation 

In this section, we evaluate the proposed joint buffer-aided 
RS and BF schemes in terms of the average end-to-end rate 
and average delay through Monte-Carlo simulations, compared 
to conventional HD RS schemes and SFD-MMRS scheme 
representing state-of-the-art in the literature. For multiple- 
antenna extension of the conventional schemes, we suppose 
that they use the IRI-free beamformers. As an upper bound of 
the average end-to-end rate, we consider the optimal joint RS 
and BF in (IT2l > assuming no IRI. The optimal weight factors 
{«j.} are adaptively obtained based on the back-pressure 
algorithm during a pre-training phase and applied to each relay 
pair selection. We assume half-full initial buffer state at all 
relays for the pre-training phase but zero initial buffer state for 
data transmission phase. We consider Rayleigh block fading 
channels with average gains of cr| TC ., erjj -p, and cr^ p. for 
i,j £ 1C, 10000 packet transmissions from the source, and 
assume I’s = Pr. throughout all simulations. 


A. Benchmarks 

1) HD Best Relay Selection (BRS) Scheme 071/ : In the 
HD-BRS scheme without buffering at relays, the best relay 
is determined by 

i* = argmax min {C S i (f), C iD (t )} , (40) 

iGK 

where C Sl {t) = \ log 2 (1 + 75t(/)) and C w {t) = 

±log 2 (l + 7 ip(f)). 

2) HD max — max Relay Selection (MMRS) Scheme 07]?: 
In the HD-MMRS scheme maintaining the two-phase opera¬ 
tion, the best relay at first time slot, i*, and the best relay at 
second time slot, j* , are selected as follows: 

i* = argmax Csi{2t), j* = argmax Cjp(2/ +1), (41) 

iG/C j'G/C 


where C Si (2t) = min { \ log 2 (1 + 7^(2/)), B max - Bi(2t - 
1)} and Cjj,(2t+1) = min {| log2(l+7jz>(2i+l)), Bj(2t)}. 

3) HD max — link Relay Selection (MLRS) Scheme 021/ : 
Since the HD-MLRS scheme has been developed for fixed rate 
transmission, we slightly modify it to adaptive rate transmis¬ 
sion by adding a link selection parameter d, for the i-th relay. 
In the conventional HD-MLRS scheme releasing the two-phase 
operation condition, the best relay i* and the best link d,> 
(integer variable; receiving 1 or transmitting 0) at each time 
slot are determined by 

(**, di*) = argmax {d z C S i(t) + (1 - di)C w (t)} , (42) 

ie/C,die{o,i} 

where Csi{t) = min { \ log 2 (1 + 7 S i{t )), B max - Bi(t - 1)} 
and C w {t) = min {| log 2 (1 + 7 w{t )), B^t - 1)}. 

4) SFD-MMRS Scheme 02?: In the SFD-MMRS scheme, 
each {<S—>-72.} or {IZ^-V} link selects the best relay and the 
second best relay based on channel gains without consideration 
of IRI. Denote the relay indices of the best and second best 
relays by i-\ and j 2 for the receiving relay and 1 \ and j 2 for 
the transmitting relay, respectively. Then they are selected as 
follows: 


h = argmax Csi{t ), i 2 = argmax Csi{t), 
*e/C fe/C\{ii} 

ji = argmax Cjvit), j 2 = argmax C jT )(t), 


where C S i(t) = min {log 2 (1 + 7 si(t )), B max - B,(t - 1)} 
and Cjv{t) = min { log 2 (1 + , B 0 (t - 1)}. If the 

best relays for both links are same, it finds the best relay pair 
among combinations with the second best relays based on a 
minimum of achievable rates, i.e., 




(*l! J2), 


if h ± j 1 

if h = ji and min{C' 5 ,; 2 (f), C hT >(t)} 
> min{C 5il (t),Cj 2 z>(i)} 

otherwise. 


(43) 


Since the SFD-MMRS scheme assumes a single antenna at the 
relays and no IRI, we extend it to multiple antennas using the 
IRI-free BF, but include the actual IRI in the results marked by 
“SFD-MMRS-IRI”. The performance degradation of the non¬ 
ideal SFD-MMRS scheme due to IRI is shown as numerical 
results in the following subsections. 


B. Average End-to-End Rate 

1) i.i.d. Channel Case (cr^-p. = = a Ti K = 

0 dB,V/,j £ K.): Fig. [2] shows the average end-to-end rate 
for varying SNR when K = 2, M = 2, B max —>• 00, and 
the average channel qualities of all the links are identical. 
The ideal SFD-MMRS scheme performs close to the upper 
bound obtained by <fl2l> . This validates that our weighted sum- 
rate maximization based on instantaneous rates works well 
to maximize the average end-to-end rate. If we impose IRI 
into the SFD-MMRS scheme, its performance is significantly 
degraded with increasing SNR, i.e., in the interference-limited 
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0 5 10 15 20 25 30 


SNR (dB) 

Fig. 2. Average end-to-end rate vs. SNR for i.i.d. channel case (K = 2, 
M = 2, Bmax -A OO, = o dB,Vi, j G AC) 

regime. Although the proposed SINR-based RS scheme im¬ 
proves the average end-to-end rate, its contribution is not 
significant at medium/high SNR. On the contrary, the average 
end-to-end rates of the other proposed schemes still increase 
with increasing SNR due to IRI cancellation/suppression. Since 
the proposed OB-based RS scheme achieves the single antenna 
upper bound, the proposed optimal BF-based, ZFBF-based, 
and MMSE-based RS schemes outperform the single antenna 
upper bound. While the optimal BF optimizes both transmit 
and receive beamformers iteratively, the ZFBF and MMSE 
BF optimize just one of the beamformers fixing the other 
beamformer. As a result, the ZFBF-based and MMSE-based 
RS schemes achieve lower average end-to-end rates than the 
optimal BF-based RS scheme. Moreover, the MMSE-based 
RS scheme achieves better performance at low/medium SNR 
than the ZFBF-based RS scheme, since the MMSE BF can 
achieve the ideal upper bound at low SNR regime as proved 
in Proposition [2] 

Regarding the conventional HD RS schemes, the HD-MLRS 
and HD-MMRS schemes outperform the HD-BRS scheme 
since they additionally utilize buffering at relays. Furthermore, 
the HD-MLRS scheme outperforms the HD-MMRS scheme 
since it obtains more diversity gain by releasing the two-phase 
operation condition. Compared to the HD RS schemes, the 
slopes of curves of the proposed optimal BF-based, ZFBF- 
based, and MMSE-based RS schemes are almost double. For 
instance, at SNR = 30 dB, the proposed optimal BF-based RS 
scheme excesses twice the average end-to-end rate by the HD- 
BRS scheme and the proposed ZFBF-based and MMSE-based 
RS schemes achieve slightly less performance than it. 

Rg. a shows the average end-to-end rates for varying 
number of antennas at the relays when K = 2, SNR = 20 
dB, B max —>• oo, and i.i.d. IRI channel conditions. Except 
for the proposed OB-based RS scheme, the other schemes 
basically increase the average end-to-end rate as the number 
of antennas increases. The proposed optimal BF-based, ZFBF- 
based, and MMSE-based RS schemes converge to the upper 
bound when increasing the number of antennas, as shown in 



Fig. 3. Effect of the number of antennas (K = 2, SNR = 20 dB, B max —> 
°°’ a STii = r7 v. j T> = a KjiZi = 0 dB,Vi,.j G AC) 


Proposition |T] for the ZFBF-based RS scheme and stated in 
Remark |2] for the optimal BF-based RS scheme. Eventually, 
it is worth noting that the proposed optimal BF-based, ZFBF- 
based, and MMSE-based RS schemes can fully recover the 
loss of multiplexing gain caused by the HD relaying as the 
number of antennas increases. Even if the number of antennas 
increases, the average end-to-end rates achieved by the SINR- 
based RS scheme and the non-ideal SFD-MMRS scheme are 
still less than those by the HD RS schemes due to the effects 
of strong IRI and limited selection diversity. 

Fig. Q] shows the average end-to-end rate for varying the 
number of relays. All the schemes achieve improved average 
end-to-end rates due to the increased selection diversity gain 
as the number of relays increases. Differently from in Fig. 0 
the proposed SINR-based RS scheme obtains significant gains 
with increasing number of relays. Accordingly, it outperforms 
the conventional HD RS schemes when K > 3. In contrast, the 
non-ideal SFD-MMRS scheme is still the worst with marginal 
improvements. This is because it never considers IRI at all 
while the SINR-based RS scheme takes IRI into account at the 
RS. The optimal BF-based, ZFBF-based, and MMSE-based RS 
schemes also approach the upper bound as the number of relays 
increases. Thus, they can asymptotically recover the loss of 
multiplexing gain with respect to the number of relays even if 
the exact convergence is not guaranteed in this case. However, 
it is shown that the rate improvement is much slower than 
when increasing the number of antennas. When K = 10, all 
the proposed schemes except for the SINR-based RS scheme 
achieve greater than or equal to double the average end-to-end 
rate of the HD-BRS scheme. 

2) i.i.d. Channel Cases with Different IRI Intensities: To 
investigate the effect of average IRI intensity, we consider two 
different average IRI channel conditions: (i) weak IRI case 
OI.-r., = -10 dB ,a 2 sn . = = 0 dB,Vi,j G AC); (ii) 

strong IRI case = 10 dB, <Jg U = a ^ v = 0 dB, Vi, j G 
AC). 

Fig. [5] (a) shows the average end-to-end rate for weak IRI 
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Fig. 4. Effect of the number of relays (M = 2. SNR = 20 dB, Bmax > oo, 
a %Tli = ^KjT) = ^TljTZi = 0 dB,Vi,J € /C) 


case when if = 3, M = 4, and B maa; —► c». When if = 3 
and M = 4, the proposed optimal BF-based, ZFBF-based, 
and MMSE-based RS schemes already almost approach the 
upper bound regardless of SNR. The optimal BF-based RS 
scheme outperforms the other schemes and the MMSE-based 
RS scheme slightly outperforms the ZFBF-based RS scheme 
at low SNR. While the OB-based RS scheme still achieves 
the single antenna upper bound regardless of the average IRI 
intensity, the SINR-based RS scheme and the non-ideal SFD- 
MMRS scheme always outperform the HD RS schemes and 
outperform the OB-based RS scheme at low/medium SNR. 
Especially, when SNR = 0 dB, the non-ideal SFD-MMRS 
scheme achieves almost the same performance as the ideal 
upper bound since this channel condition yields very weak 
interference which is negligible. 

Fig. [5] (b) shows the average end-to-end rate for strong IRI 
case in the same setup. The proposed ZFBF-based and OB- 
based RS schemes yield exactly the same performance as in 
Fig- 12] (a) since they do not depend on the intensity of IRI due 
to perfect IRI cancellation. The proposed MMSE-based RS 
scheme achieves almost identical performance as the proposed 
ZFBF-based RS scheme even at low SNR since the intensity 
of IRI is already strong compared to the {5— >1Z } channel con¬ 
ditions. The optimal BF-based RS scheme performs between 
the upper bound and the MMSE-based and ZFBF-based RS 
schemes regardless of the intensity of IRI and SNR. In contrast, 
the non-ideal SFD-MMRS scheme significantly degrades the 
average end-to-end rate which is always worse than those of 
the HD RS schemes. 

3)non-i.i.d. Channel Case: As a more practical net¬ 
work scenario, we consider a network with three relays 
where the links have different average channel gains as 
bsTCii a sn 2 ’ °stc 3 ] = [IjOj—1] ^B, [cr^p, <t iz 3 v\ = 
[-1,0,1] dB, and = a n 2 n 3 = a n 3 n 2 = 

0 dB, ct^ i 7 j 3 = = —1 dB. In this setup, {S — 1Z 2 — T>} 

are equally spaced, IZi is closer to S and TZ 3 is closer 
to V. {Tfi — ll 2 — 7f3} are equally spaced and therefore. 



SNR (dB) 
la) 



SNR (dB) 
(b) 


Fig. 5. Effect of average IRI intensity (K = 3, M = 4, Bmax —► 
oo, = a iz T> = 0 dB,Vi,j G /C) (a) Weak IRI ( 0 ^ . 7 ^. = 

— 10 dB, \/i,j G /C) (b) Strong IRI ^ = 10 dB,Vz,j G /C) 

{TZi — 7f 3 } becomes a less interfered channel than {Hi - H 2 } 
and {72-2 — 7?:i}. Fig. [6] shows the average end-to-end rate 
with various SNR values for the non-i.i.d. channel case. 
The basic trend is the same as that for i.i.d. channel cases 
shown previously. Hence, our proposed weighted sum-rate 
maximization approach works well also under non-identical 
channel condition. Furthermore, the proposed optimal BF- 
based, ZFBF-based, MMSE-based RS schemes are still effec¬ 
tive for generalized asymmetric network topologies. 


C. Delay Performance 

Basically, the buffer-aided relaying obtains additional se¬ 
lection diversity gain by sacrificing delay performance. Even 
if we mainly focus on the average end-to-end rate of delay- 
tolerant applications, we evaluate the average delay perfor- 
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Fig. 6. Average end-to-end rate vs. SNR for non-i.i.d. channel case (K = 3, 
M = 2, j i °57j 3 ] = [1,0,—1] dB, [cr^ lT> , = 

[-1,0, 1] dB, ct^ i7 j 2 = o-^ 2Ki = ct|j 27 j 3 = &K 3 K 2 = 0 dB ’ ff RiR 3 = 
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mance through simulations. Due to a full-queue assumption at 
the source, the average delay is defined as the average queueing 
delay at relays, which implies the time difference between the 
arrival time of a single packet at a relay and the successfully 
received time of the packet at the destination in number of time 
slots (number of channel uses). A single packet is transmitted 
from the source at each time slot and its size is determined by 
the selected {<S— >TZ} channel gain according to adaptive rate 
transmission. Therefore, a delay of one means that a packet is 
stored at the relay in a certain time slot and all information 
bits contained in the packet are successfully forwarded to the 
destination in the next time slot. As stated in Section [V] we 
also consider 10000 packet transmissions from the source for 
the delay performance in following. 

In Fig. [7] we show the average delay for varying number 
of antennas when K = 2, SNR = 20 dB, B max —>• 00, 
and v'sK-i = °n 3 v = = 0 dB,Vi, j G K. Except 

for the proposed OB-based RS scheme, the average delays of 
all the schemes decrease as the number of antennas increases 
since the effective channel gain increases with the number of 
antennas. In the proposed OB-based RS scheme, the average 
delay is varying with the values slightly less than 50 time slots 
because its effective channel gain is same as the single antenna 
case. The ideal SFD-MMRS scheme has moderate average 
delay and the non-ideal SFD-MMRS scheme and the SINR- 
based RS scheme have very short delays approaching one. The 
reason is that both schemes have the bottleneck in {<S—^7?.} 
link due to uncoordinated IRI under i.i.d. channel condition 
and thus the size of packets transmitted from the source to 
a relay is much smaller than average {1Z -A 77} link rate. On 
the contrary, the proposed optimal BF-based, ZFBF-based, and 
MMSE-based RS schemes have longer delays than that of the 
ideal SFD-MMRS scheme. In particular, the proposed ZFBF- 
based RS scheme has a significant delay when M = 2 since 
its bottleneck is the {1Z—>T>} link while the bottleneck of 
the MMSE-based RS scheme is the {5— >1Z} link in average 
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sense. After all, the average packet size from the source in 
the ZFBF-based RS scheme is larger but its average {1Z—>V} 
link rate is smaller than those of the MMSE-based RS scheme. 
However, the ZFBF-based RS scheme recovers the loss in the 
effective SNR at the destination and thus the average delay is 
significantly reduced when M > 4. Similarly, in the optimal 
BF-based RS scheme, both average {5—1-7?.} and {7Z—>V} 
link rates are more balanced than those in the MMSE-based 
RS scheme. This results in a larger average {5— >7Z} link rate 
and a smaller average {1Z—>V} link rate for the optimal BF- 
based RS scheme, compared to the MMSE-based RS scheme. 
Accordingly, the average delay of the optimal BF-based RS 
scheme becomes worse than that of the MMSE-based RS 
scheme, since its average packet size is larger but its average 
{IZ^yU} link rate is smaller than those of the MMSE-based 
RS scheme. 

Fig. |8] shows the average delay for varying number of relays 
in the same setup. The basic trend of the ideal SFD-MMRS 
scheme increases as the number of relays increases since the 
more relays exist, each relay is less likely to be selected. 
The SINR-based RS scheme follows the similar trend while 
the non-ideal SFD-MMRS scheme has still very short delays 
approaching one. This is because the SINR-based RS scheme 
can provide much larger average link rates as the number 
of relays increases as shown in Fig. [4] The other proposed 
schemes optimizing beamformers have a different trend where 
the average delay decreases until K = 3 (K = 5 in the ZFBF- 
based RS scheme) and increases again as the number of relays 
increases further. When K = 2 and M = 2, the minimum 
network setup, the degrees of freedom optimizing transmit 
and/or receive beamformers is too restricted but one additional 
relay gives an additional degree of freedom improving link 
rates. Hence, the average delays of the proposed schemes are 
rather decreased when K = 3, but increased again after then, 
since the additional gain is almost saturated after a certain 
moment compared to the reduced selection opportunity per 
relay. From Figs. [7] and [8] since the proposed ZFBF-based 
RS scheme requires too much delay at the minimum network 
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setup (K = 2, M = 2), the MMSE-based RS scheme is more 
desirable for this setup in the perspective of the average delay 
performance. 


D. Behavior of the Optimal Weight Factors 

In this subsection, we investigate the behavior of the optimal 
ctk parameters for the proposed schemes in two different 
channel cases: i.i.d. and non-i.i.d. channel cases with three 
relays. In simulations, to find the optimal a k values, we 
employ the back-pressure algorithm approach using a pre¬ 
training phase as stated in (IT5l) . Fig. [3 shows the optimal otk 
values obtained after the pre-training phase when K = 3 and 
M = 2. 

Fig. 0 (a) shows an i.i.d. channel case where all the links 
have an identical average channel quality as 0 dB. In the 
figure, all ak values for each scheme are identical due to 
the i.i.d. channel condition. In principle, if ctk is close to 
one, the {5— >1Z} link rate dominates the cost function and 
if o-f; is close to zero, the {IZ^V} link rate dominates the 
cost function in the relay pair selection. The upper bound and 
the optimal BF-based RS scheme balance both link rates as 
a* k ss 0.5 since the optimum is achieved when E[C's7j(£)] = 
E[CW> (£)]■ In contrast, the optimal ctk values of the proposed 
MMSE-based and SINR-based RS schemes are close to one, 
while that of the proposed ZFBF-based RS scheme is close 
to zero. This is because the bottleneck of the MMSE-based 
and SINR-based RS schemes is the {<S— >1Z} link while the 
bottleneck of the ZFBF-based RS scheme is the {1Z—>V} link. 

Fig. 0 (b) shows a non-i.i.d. channel case, the same setup 
in Fig. [6] For all the schemes, o fs have different values 
according to average channel quality and 0:2’s have almost 
the same values as those for the i.i.d. channel case since 72-2 
has the same average channel quality as the i.i.d. channel 
case. In the figure, it holds that a\ < 0:2 < 0:3 because 

> a STZ 2 = a Ti 2 V^ an d a STZ 3 < a n 3 Tn an ^ 

the optimal at parameter tends to be determined in order to 
balance both {5— >1Z} and {7Z—>V} links. As SNR increases, 




Fig. 9. Optimal a value after a training phase ( K = 3, M = 2) 
(a) i.i.d. channel case = rrf , D = rrf = 0 (IB. Vz, j G 

K) (b) non-i.i.d. channel case ([ 057 ’ °S 7 j 2 ’ °S 7 J 3 ] = H — 11 dB, 
cr TZ 2 'D’ a TZ 3 'Di = [ _ dB, = CT 7J. 2 7J 3 = 

a K 3 iz 2 = 0 cr iZi'iz 3 ~ <7 v. 3 ji 1 = ~ 1 * 111 ) 

the differences among ak values are reduced since imbalance 
effect among links is diminished at high SNR. Similarly to the 
i.i.d. channel case, the upper bound and the optimal BF-based 
RS scheme are biased at ctk = 0.5, and the proposed MMSE- 
based and SINR-based RS schemes are biased to be close to 
one while the ZFBF-based RS scheme is biased to be close to 
zero. 

E. Effects of Finite Buffer Size 

In practice, the buffer size at relays is finite and thus it 
restricts the performance since a full-buffer relay cannot be se¬ 
lected as the receiving relay. We investigate the performance in 
terms of average end-to-end rate and average delay according 
to the finite buffer size. Fig. [TO] shows the average end-to-end 
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Fig. 10. Average end-to-end rate with finite buffer size (K = 3, M = 2, 
SNR = 20 dB, = cr^ 7 j.. = 0 dB, Vi, j S £) 


rate for varying buffer size when K = 3, M = 2, SNR = 20 
dB, and a% Ut = a\. v = a^. n . = 0 dB,V*,j G 1C. All 
the schemes rapidly converge to their own performance upper 
limits with infinite buffer size as the buffer size increases. 
Accordingly, a buffer size B max > 50 bits/Hz is sufficient to 
obtain the performance upper limits with infinite buffer size. 
For instance, when the bandwidth is 10 MHz, the buffer size 
is required to be about 60 MB in order to achieve the average 
end-to-end rate with infinite buffer size. This value is allowable 
at relays in the viewpoint of present memory size. 

Fig. QT| shows the average delay performance with finite 
buffer size. For all the schemes, the average delays converge 
to their own limits as the buffer size increases although the con¬ 
vergence speed is different between the schemes. Interestingly, 
the finite buffer size can be helpful to reduce the average delay 
without a loss in the average end-to-end rate. For example, 
even if the proposed ZFBF-based RS scheme has the longest 
delay, the average delay can be less than 15 time slots with 
achieving the average end-to-end rate limit if the buffer size 
is set to 50 bits/Hz. Therefore, if the buffer size is set to an 
appropriate value, it is enough to achieve near-optimal average 
end-to-end rate under reasonable average delay. 

VI. Conclusion 

In this paper, we proposed virtual FD buffer-aided joint RS 
and BF schemes taking IRI into account in a buffer-aided 
multiple relays network, where each relay is equipped with 
multiple antennas. We first formulated a weighted sum-rate 
maximization based on instantaneous rates maximizing the 
average end-to-end rate. Based on the alternative objective 
function, we proposed various RS schemes based on optimal 
and suboptimal BF designs to cancel or suppress IRI taking a 
trade-off between computational complexity and performance 
into consideration. Through simulations, the proposed joint RS 
and BF schemes were evaluated in terms of the average end- 
to-end rate and average delay, compared to several conven¬ 
tional HD RS and SFD-MMRS schemes. In numerical results. 



Fig. 11. Average delay with finite buffer size (K = 3, M = 2, SNR = 20 
dB. cr%-jz. = a \-T> = a Tl j Tl i = 0 S /C) 


asymptotic trends were investigated with respect to the number 
of relays and the number of antennas at relays. The proposed 
joint RS and BF schemes recover the loss of multiplexing gain 
in the HD relaying even in the presence of IRI as the number 
of antennas and/or the number of relays increase. Although the 
complexities of the proposed joint RS and BF schemes are high 
due to global CSI required, they can be useful benchmarks in 
system simulations. In addition, the behavior of the optimal 
weight factor and the effects of finite buffer size were shown 
in various different network setups. Basic trend in the optimal 
weight factor is moving toward to reduce the link rate gap 
between the bottleneck link and the other link. Although the 
finite buffer size limits the average end-to-end rate, it can 
help to bound the average delay with achieving near-optimal 
average end-to-end rate if it is set to an appropriate value. 
For future studies, it is possible to extend to multiple source- 
destination pairs with multiple antennas, to apply for non-full 
queue traffic at the source, to consider imperfect CSI and BSI, 
to develop a low complexity and limited feedback RS scheme, 
and to apply for other applications such as cognitive radio and 
physical layer security. 
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